Conversation
95af585 to
99ab63b
Compare
There was a problem hiding this comment.
Pull request overview
Adds a new remote seed dataset loader for the SALAD-Bench HuggingFace dataset, making it available through PyRIT’s automatic SeedDatasetProvider discovery and documenting it in the dataset-loading guide.
Changes:
- Added
_SaladBenchDatasetremote loader that fetches SALAD-Bench from HuggingFace and converts rows intoSeedPrompts. - Registered the loader for auto-discovery via
pyrit.datasets.seed_datasets.remote.__init__. - Added unit tests and updated the “Loading Built-in Datasets” notebook to show the new dataset name.
Reviewed changes
Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.
| File | Description |
|---|---|
pyrit/datasets/seed_datasets/remote/salad_bench_dataset.py |
New HuggingFace-backed loader that maps SALAD-Bench entries into SeedDataset/SeedPrompt. |
pyrit/datasets/seed_datasets/remote/__init__.py |
Imports/exports _SaladBenchDataset so it’s registered and discoverable. |
tests/unit/datasets/test_salad_bench_dataset.py |
Unit tests validating dataset fetching and config passthrough behavior. |
doc/code/datasets/1_loading_datasets.ipynb |
Documentation notebook updated to reflect the new dataset in the available list (but currently includes executed outputs/metadata). |
Comments suppressed due to low confidence (1)
pyrit/datasets/seed_datasets/remote/salad_bench_dataset.py:74
- The
authorslist formatting is inconsistent with other remote dataset loaders and is hard to read (and likely exceeds the repo’s 120-char line length). Please format the authors list across multiple lines (one author per line) like other dataset loaders for readability and consistent styling.
dataset_name=self.hf_dataset_name,
config=self.config,
99ab63b to
7db0e9c
Compare
Add remote dataset loader for SALAD-Bench (walledai/SaladBench), a hierarchical safety benchmark with ~30k prompts organized into 6 domains, 16 tasks, and 65+ categories (ACL 2024). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
7db0e9c to
bbc66cf
Compare
There was a problem hiding this comment.
Pull request overview
Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.
Comments suppressed due to low confidence (1)
doc/code/datasets/1_loading_datasets.ipynb:242
- The notebook metadata was updated to a different local Python version. To avoid unnecessary diffs across environments, consider reverting/normalizing kernel metadata (or stripping it) in committed docs notebooks.
"version": "3.13.5"
The HF dataset identifier is now a class constant HF_DATASET_NAME instead of a constructor parameter, consistent with other loaders. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
There was a problem hiding this comment.
Pull request overview
Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.
Comments suppressed due to low confidence (1)
doc/code/datasets/1_loading_datasets.ipynb:200
- This notebook output includes a DeprecationWarning with a user-specific temporary file path (
C:\\Users\\...\\AppData\\Local\\Temp\\...). Please clear/sanitize this output (and ideally avoid emitting the warning in the example) so docs are reproducible and don't embed local filesystem paths.
"C:\\Users\\romanlutz\\AppData\\Local\\Temp\\ipykernel_40808\\4021500943.py:10: DeprecationWarning: is_objective parameter is deprecated since 0.13.0. Use seed_type='objective' instead.\n",
" memory.get_seeds(harm_categories=[\"illegal\"], is_objective=True)\n"
]
Wrapping in Jinja2 raw tags preserves original dataset text that
may contain {{ }} or {% %} syntax. Also precomputes loop constants.
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
| { | ||
| "name": "stderr", | ||
| "output_type": "stream", | ||
| "text": [ | ||
| "\r\n", | ||
| "Loading datasets - this can take a few minutes: 0%| | 0/49 [00:00<?, ?dataset/s]" | ||
| "\r", | ||
| "Loading datasets - this can take a few minutes: 0%| | 0/50 [00:00<?, ?dataset/s]" | ||
| ] |
There was a problem hiding this comment.
This notebook cell output includes run-specific progress bar output (dataset counts/timings) captured in stderr. Please clear outputs before committing to keep docs deterministic and reduce noisy diffs.
Add remote dataset loader for SALAD-Bench (walledai/SaladBench), a hierarchical safety benchmark with ~30k prompts organized into 6 domains, 16 tasks, and 65+ categories (ACL 2024).